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DNA construct and its use. 
The present invention relates to a new DNA construct for transformation into oilseed 
plants. The DNA construct comprises nucleotide sequences encoding peptides with enzyme 
activities necessary for the high-level production and esterification of keto group-containing 
xanthophylls in oilseed plants. 

Background of the invention 
Carotenoids are produced de novo by plants, fungi, algae and some bacteria. A 
number of biosynthetic steps are needed for the biological production of the carotenoids. 
There are two chemically different groups of carotenoids, namely carotenes containing only 
carbon and hydrogen molecules and xanthophylls containing oxygen in the molecule in 
addition to carbon and hydrogen. 

The xanthophylls, and particularly astaxanthin (3,3'-dihydroxy-p-p-carotene-4,4'- 
dione), are often colored pigments and are used as such or as anti-oxidants. 

Carotenes are biological precursors for the production of the oxygen-containing 
xanthophylls. There are two types of enzymes responsible for the introduction of hydroxy 
groups and keto groups into the carotenes, namely hydroxylases and kctolases, respectively. 

The keto group-containing xanthophyll astaxanthin, which has keto and hydroxy 
groups, is biosynthetically produced from beta-carotene. 

Large-scale production of xanthophylles from natural sources is at present performed 
by AstaCarotene AB, Gustavsberg, Sweden, by cultivation of the alga Haematococcus 
pluvialis for the production of astaxanthin in esterified form. 

It would be desirable to be able to produce keto group-containing xanthophylls 
particularly astaxanthin, in oilseed plants. Oilseed plants have naturally P-carotene 
hydroxylases but lack p-carotene C-4-oxygenase enzymes or ketolases. 

Description of the invention 
The present invention provides DNA constmcts enabling and promoting 
production of keto group containing xanthophylls, especially astaxanthin, in oilseed plants, 
such as rape, sunflower, soybean and mustard. The DNA construct is transformed into the 
oilseed plant cell for expression of a protein or fused protein which has an enzyme activity 
enabling keto group insertion into a carotene or hydroxy carotene for the biosynthetic 
production of a keto group containing xanthophyll, such as cantaxanthin (P,P-carotene-4,4'- 
dione) and/or astaxanthin. Use is thus made of the biosynthetic pathway of the oilseed plant to 
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produce carotenoids. The naturally occurring synthesis of carotenoids involves a number of 
enzymes, namely 1-D-deoxyxylulose 5-phosphate synthase, isopentenyl 
pyrophosphate:dimethylallyl pyrophosphate isomerase, geranylgeranyl pyrophosphate 
synthase, phytoene synthase, phytoene desaturase, zeta-carotene desaturase, lycopene beta- 
cyclase, P-carotene hydroxylase, and p-carotene C-4-oxygenase. Genes coding for peptides 
having these enzymatic activities may be inserted into the DNA construct of the invention, 
one or several per construct, to promote high-level production in the transgenic oilseed plant. 
In case only one enzyme coding gene is inserted per plant, two or more plants may be 
sexually interbred to produce plants containing all the desired enzyme activities. 

Thus, the present invention is directed to a DNA construct comprising in the 5' to 3' 
direction of transcription operably linked a promoter region directing transcription to the seed 
of an oilseed plant, a nucleotide sequence coding for at least one peptide with enzyme activity 
necessary for keto group containing xanthophyll production and esterification in an oilseed 
plant and a transcriptional teraiination region. 

In a preferred embodiment of the invention the DNA construct additionally 
comprises between the promoter region and the nucleotide sequence coding for at least one 
peptide with enzyme activity a nucleotide sequence coding for a transit peptide directing the 
translated fusion polypeptide to the chloroplast of the oilseed plant 

The DNA construct is preferably such that the promoter is a napin promoter, the 
peptide with enzyme activity necessary for keto group containing xanthophyll production is 
selected from the group consisting of peptides with 1-D-deoxyxylulose 5-phosphate synthase, 
isopentenyl pyrophosphate.dimethylallyl pyrophosphate isomerase, geranylgeranyl 
pyrophosphate synthase, phytoene synthase, phytoene desaturase, zeta-carotene desaturase, 
lycopene beta-cyclase, p-carotene hydroxylase, and P-carotene C-4-oxygenase activity. To 
promote esterification of astaxanthin a nucleotide sequence coding for a peptide with acyl 
transferase activity may be included in the group. 

In a preferred embodiment of the DNA construct according to the invention the 
nucleotide sequence coding for a peptide with enzyme activity is a nucleotide sequence 
coding for a N-terminally truncated P-carotene C-4-oxygenase gene from the alga 
Haematococcus pluvialis. 

An example of the DNA construct of the invention is presented in the sequence 
listing as SEQ ID NO: 1 and in Fig.l . 
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The present invention is also directed to a transgenic oilseed plant cell 
comprising the DNA construct of the invention, and preferably the oilseed plant is selected 
from the group consisting of rape, sunflower, soybean and mustard. 

The invention is additionally directed to transgenic oilseed plant-produced 
5 xanthophyll, e.g. canthaxanthin and astaxanthin. 

A preferred aspect of the invention is directed to transgenic oilseed plant- 
produced astaxanthin esters. 

The present invention will now be illustrated with reference to the DNA 
construct disclosed in the sequence listing and in Fig.l, and the following description of 
10 embodiments. However, the invention is not limited to these exemplifications. 

Short description of the drawings 
Fig,l illustrates the nucleotide sequence of the DNA construct comprising the napin promoter, 
the chloroplast localization signal, the N-terminally truncated P-carotene C-4-oxygenase gene 
and the termination sequence, and the deduced amino acid sequences of the transit peptide 
15 and the p-carotene C-4-oxygenase. 

Description of embodiments 
The invention is illustrated by production of astaxanthin in the seed of oilseed 
rape. The astaxanthin produced in the seed of the transgenic plant is extracted as part of the 
extracted oiL By use of conventionally used protocols for Agrobacterium tumefaciens 
20 mediated transformation such as described by (Hoekema et al. 1 983, An et al. 1 986, Fry et al. 
1987, DeBlock et al. 1988, Radke et al.l988, or Moloney et al. 1989) transgenic plants are 
produced having a chimeric DNA construct that is genetically inherited and is able to produce 
astaxanthin. The nucleotide sequence of the chimeric DNA construct consist of four parts of 
different genetic origin namely: (1) a promoter, (2) a localization signal, (3) a p-carotene C-4- 
25 oxygenase coding region and (4) a termination sequence. 

The napin promoter directs transcription to the seed of oilseed rape (St&lberg et 
al 1996). This promoter was coupled to a localization signal similar but not identical to a 
transit peptide (TP) of Rbcsla (Krebbers, 1988) that directs the translated product of a fused 
gene to the chloroplast. The promoter and the TP sequence were ligated to a part of the coding 
30 sequence of a ketolase gene BCK (Kajiwara et al. 1995). This enzyme oxygenates p-carotene 
to canthaxanthin, (Fraser et al. 1997). The chimeric DNA construct was then coupled to a 
suitable termination sequence, e.g. that of the Agrobacterium tvunefaciens nopaline synthase 
gene (the nos 3' end)(Bevan et al. 1983), as illustrated in Fig.l. 
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Cellular storage of Astaxant^ i^ 

The storage of large amounts of free astaxanthin in plants will be difficult due to 
toxic effects of the molecule as it intercalates in the plant membranes. An effective 
esterification of astaxanthin to fatty acids enables storage of the esterified molecules in 
triacylglycerol containing oleosomes. Thus, an acyl transferase can be claimed to be of 
fundamental importance for the process, as is proteins that can mediate transport of different 
forms of astaxanthin from the chloroplast to the vesicles. 
Sequences and oligonucleotides us ed in the construction of the DNA construct 

1. Napin promoter (GeneBank ACCESSION No. J02798) 

This promoter sequence, a 1145 base pair fragmait including the 5' leader 
sequence has a unique Hindffl site at the 5' end. The 3' end was synthesized with an 
additionally 6 nucleotide BamHI site. 

2. Transit peptide similar to RBCSla (GeneBank ACCESSION No. XI 361 1. XI4565) 

The transit peptide (TP) was amplified by PGR from -28 to the end of the transit 
cleavage aa=54/55 site of the Rbcsla gene. The 5' end was synthesized with a BamHI site 
and similarly the 3 * sequence was synthesized with a Xbal site. The two following 
oligonucleotides were used for the PGR amplification. 

BamHI 

5' primer: TPl 5'AGAC GGATCC TCAGTCACACAAAGAGTA 3' 

Sad Xbal 

3 ' primer: TP2 5 'GTTC G AGCTC TCTAGA CATGCAGTTAACGC 3 ' 

3. BCK (fi-carotene C-4 oxygenase) (Genebank ACCESSION No. D4588I) 

The BCK fragment was amplified by PGR including a 5' Xbal site and was 
ligated to the TP already described. The 5' primer (BCKl) used for PGR, is homologous to 
the BGK sequence from nucleotide 264 and the 3* oligonucleotide (Ax40) ends with a stop 
codon and was synthesized with a Sad restriction site for cloning. The synthesized fragment 
was fused to the TP as shown in Fig 1. 
Oligonucleotides used for PGR: 

Xbal 

5 ' primer: BGKl 5 ACAG TCTAGA ATGGGATGGGAGTGGTCA 3 ' 

Sad 

3 'primer: AX40 5 CACCG AGCTCGATGAGAGTGTTGTGGAGA 3 ' 
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Description of SEQ ID NO:l and SEQ ID NO:2 

The sequences shown i Fig.l are the same as the two sequences which are 
shown in the sequence listing. 

The SEQ ID N0:1 is a nucleotide sequence composed of the following features: 
5 Nucleotide No. 



Cloning site HindUI 1 -6 



15 



10 



Napin Promoter 1 -1 145 

Cloning site BamHI 1 146-1 151 

Transit peptide leader 1 152-1 1 78 

Transit peptide coding 1 1 79- 1 347 

Cloning site Xbal 1348-1353 

P-carotene C-4-oxygenase 1 354-22 1 7 
p-carotene C-4-oxygense 3' xmtranslated 2218-2266 

Cloning site Sad 2267-2272 

Nopaline synthetase termination 2273-2536 

Cloning site EcoRI 2538-2543 



The SEQ ID NO: 2 is a deduced amino acid sequence of the fusion protein of 
the transit peptide and the peptide with p-carotene C-4-oxygenase activity. 
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Claims 

!• A DNA construct comprising in the 5' to 3' direction of transcription operably 
linked a promoter region directing transcription to the seed of an oilseed plant, a nucleotide 
sequence coding for at least one peptide with enzyme activity necessary for keto group 
5 containing xanthophyll production and esterification in an oilseed plant and a transcriptional 
termination region. 

2. The DNA construct according to claim 1, which between the promoter region 
and the nucleotide sequence coding for at least one peptide with enzyme activity additionally 
comprises a nucleotide sequence coding for a transit peptide directing the translated fusion 

10 polypeptide to the chloroplast of the oilseed plant. 

3. The DNA construct according to claim 1 or 2, wherein the promoter is a napin 
promoter, the peptide with enzyme activity necessary for keto group containing xanthophyll 
production and esterification is selected from the group consisting of peptides with, 1-D- 
deoxyxylulose 5-phosphate synthase, isbpentenyl pyrophosphateidimethylallyl pyrophosphate 

15 isomerase, geranylgeranyl pyrophosphate synthase, phytoene synthase, phytoene desaturase, 
zeta-carotene desaturase, lycopene beta-cyclase, p-carotene hydroxylase, P-carotene C-4- 
oxygenase, and acyl transferase activity. 

4. The DNA construct according to any one of claims 1-3, wherein the 
nucleotide sequence coding for a peptide with enzyme activity is a nucleotide sequence 

20 coding for a N-tcrminally truncated p-carotene C-4-oxygenase gene from the alga 
Haematococcus pluvialis. 

5. The DNA construct according to claim 4, wherein the nucleotide sequence is 
SEQIDNOrl. 

6. Transgenic oilseed plant cell comprising the DNA construct of any one of 

25 claims 1*5 . 

7. Transgenic oilseed plant cell according to claim 6, wherein the oilseed plant is 
selected from the group consisting of rape, sunflower, soybean and mustard. 

8. Transgenic oilseed plant-produced xanthophyll. 

9. Transgenic oilseed plant-produced xanthophyll according to claim 8, wherein 
30 the xanthophyll is canthaxanthin 

10. Transgenic oilseed plant-produced xanthophyll according to claim 8, 
wherein the xanthophyll is astaxanthin. 

11. Transgenic oilseed plant-produced xanthophyll according to claim 8, 
wherein the xanthophyll is astaxanthin esters. 



wo 01/20011 



PCT/SEOO/01767 



1/3 

Napin promoter 

AAGCTTTCTTCaLTCGGTGATTGATTCCOTTAAAGACTTATGTTTCTTATCTTGCTTCTGA 

GGCAAGTATTCAGTTACCAGTTACCACTTATATTCTGGACTTTCTGACTGCATCCTCATT 

TTTCCAAa^TTTTAAATTTCACTATTGGCTGAATGCTTCTTCrTTGAGGAAGAAAOlATT 

CAGATGGCAGAAATGTATCAACCAATGCATATATACAAATGTACCTCTTGTTCTCAAAAC 

ATCTATCGGATGGTTCCATTTGCTTTGTCATCCAATTAGTGACTACTTTATATTATTCAC 

TCCTCTTTATTACTATTTTCATGCGA6GTTGCCAT6TACATTATATTTGTAAGGATTGAC 

GCTATTGAGCGTTTTTCTTCAATTTTCTTTATTTTAGACATGGGTATGAAATGTGTGTTA 

GAGTTGGGTTGAATGAGATATACGTTCAAGTGAAGTGGCATACCGTTCTCGA6TAAGGAT 

GACCTACCCATTCTTGAGACAAATGTTACATTTTAOTATCAGAGTAAAATGTGTACCTAT 

AACTCiUUlTTCGATTGACATGTATCCULTTC^ACATAAAATTAAACCAGCCTGCACCT 

TCCACATTTCAAGTATTTTCAAACCGTTCGGCTCCTATCCACCGGGTGTAACAAGACGGA 

TTCCGAATTT66AAGATTTTGACTCAAATTCCCAATTTATATTGACCGTGACTAAATCAA 

CTTTAACTTCTATAATTCTGATTAAGCTCCCAATTTATATTCCCAACGGCACTACCTCCA 

AAATTTATAGACTCTCATCCCCTTTTAAACCAACTTAGTAAACGTTTTTTTTTTTAATTT 

TATGAAGTTAAGTTTTTACCTTGTTTTTAAAAA6AATC6TTCATAAGATGCCATGCCAGA 

ACATTAGCTACACGTTACACATAGCATGCAGCCGCGGAGAATTGTTTTTCTTCGCCACTT 

GTCACTCCCTTCAAACACCTAAGAGCTTCTCTCTCACAGCACACACATACAATCACATGC 

GTGCATGCATTATTACa^CGTGATCGCCATGCAAATCTCCTTTATAGCCTATAAATTAACT 

CATCCGCTTCACTCTTTACTCAAACCAAAACTCATCAATACAAACAAGATTAAAAACATA 

End -28 untranslated leader TP start 

CACGAGGATCCTCAGTCACACAAAGAGTAAAGAAGAACAATGGCTTCCTCTATGCTCTCT 

M A S S M L S 

TCCGCTACTATGGTTGCCTCTCCGGCTCAGGCCACTATGGTC6CTCCTTTCAACGGACTT 
SATMVASPAQATMVAPFNGL 

AAGTCCTCCGCTGCCTTCCCAGCCACCC6CAAGGCTAACAACGACATTACTTCCATCACA 
KSSAAFPATRKANNDIT SIT 



FIG.l 
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TP End C-4-Oxygenase 
AGCAACGGCGGACGCGTTAACTGCATGTCTAGAATGCCATCCGAGTCGTCAGACGCAGCT 
SNG6RVNCMSRMPSESSDAA 

CGTCCTGCGCTAAAGCACGCCTACAAACCTCCAGCATCTGACGCCAAGGGCATCACGATG 
RPALKHAYKPPASDAKGITM 

GCGCTGACCATCATTGGCACCTGGACCGCAGTGTTTTTACACGCAATATTTCAAATCAGG 
ALTI IGTWTAVFLHAIFQIR 

CTACCGACATCCATGGACCAGCTTCACTGGTTGCCTGTGTCCGAAGCCACAGCCCAGCTT 
LPT SMD QLHW LPVSEATAQL 

TTGGGCGGAAGCAGCAGCCTACTGCACATCGCTGCAGTCTTCATT6TACTTGAGTTCCTG 
L66SSSLLHIAAVFIVLEFL 

TACACTGGTCTATTCATOACCACACATGACGaUlTGCATGGCACCyiTAGCTTTGAGGC^ 
YTGLFITTH DAMHGTIALRH 

AGGCAGCTCAATGATCTCCTTGGCAACATCTGCATATCACTGTACGCCTGGTTTGACTAC 
RQLNDLL6NICISLYAWFDY 

AGCATGCTGCATCGCAAGCACTGGGAGCACCACAACCATACTGGCGAAGTGGGGAAAGAC 
SMLHRKHWEHHNHTGEV6KD 

CCTGACTTCCACAAGGGAAATCCCGGCCTTGTCCCCTGGTTCGCCAGCTTCATGTCCAGC 
PDFHKGNPGLVPWFASFMSS 

TACATGTCCCTGTG6CAGTTTGCCCGGCTGGCATGGTGGGCAGTGGTGATGCAAATGCTG 
YMSLWQFARLAWWAVVMQML 

GGGGCGCCCATGGCyU^TCTCCTAGTCTTCATGGCTGCAGCCCCAATCTTGTCAGC^ 
GAPMANLIiVFMAAA P 1 L SAF 

CGCCTCTTCTACTTCGGCACTTACCTGCCACACAAGCCTGAGCCAGGCCCTGCAGCAGGC 
RLFYF6T YLPHKPEPG PAAG 

TCTCAGGTGATGGCCTGGTTCAGGGCCAAGACAAGTGAGGCATCTGATGTGATGAGTTTC 
SQVMAWFRAKTSSASDVMS F 

CTGACATGCTACCACTTTGACCTGCACTGGGAGOlCCACaiGATGGCCCTTTGCCCCCTGQ 
LTCYHFDLHWEHHRWPFAPW 

C-4 oxygenase Stop 
TGGCAGCTGCCCCACTGCCGCCGCCTGTCCGGGCGTGGCCTGGTGCCTGCCTTGGCATGA 
WQLPHCRRLSGRGLVPALA* 



FIG.l (cont.) 
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C-4 oxygenase untranslated region Nos term 

CCTGGTCCCTCCGCTGGTGACCCAGCGTCTGCACAAGAGTGTCATGGAGCTCGAATTTCC 

CCGATCGTTCAAACATTTGGCAATAAAGTTTCTTAAGATTGAATCCT 

CGATGATTATCATATAATTTCTGTTGAATTACGTTAAGCATGTAATAATTAACATGTAAT 

GCSITGACGTTATTTATGAGATGGGTTTTTATGATTAGAGTCCCGCAATTATACATOT 

ACGCGATAGAAAACAAAATATAGCGCGCAAACTAGGATAAATTATCGCGCGCGGTGTCAT 
end 

CTATGTTACTAGATCGGGAATTC 



Pig.l (cont.) 
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SEQUENCE liISTING 



<110> AstaCarotene AB 



<120> DNA construct and its use 

<130> 2 92 95 -AstaCarotene 

<140> 
<141> 

<160> 2 

<170> Patent In Ver- 2.1 

<210> 1 
<211> 2543 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: napin promoter 

choroplast localization signal + beta-carotene C-4 oxygenase 
coding sequence + termination sequence 

<220> 

<221> promoter 
<222> (1) (1145) 

<220> 

<221> transit_peptide 
<222> (1179) . . (1347) 

<220> 
<221> CDS 

<222> (1179) . . (2217) 
<220> 

<221> terminator 
<222> (2273) (2536) 

<400> 1 



aagctttctt 


catcggtgat 


tgattccttt 


aaagacttat 


gtttcttatc 


ttgcttctga 


60 


ggcaagtatt 


cagttaccag 


ttaccactta 


tattctggac 


tttctgactg 


catcctcatt 


120 


tttccaacat 


tttaaatttc 


actattggct 


gaatgcttct 


tctttgagga 


agaaacaatt 


180 


cagatggcag 


aaatgtatca 


accaatgcat 


atatacaaat 


gtacctcttg 


ttctcaaaac 


240 


atctatcgga 


tggttccatt 


tgctttgtca 


tccaattagt 


gactacttta 


tattattcac 


300 


tcctctttat 


tactattttc 


atgcgaggtt 


gccatgtaca 


ttatatttgt aaggattgac 


360 


gctattgagc 


gtttttcttc 


aattttcttt 


attttagaca 


tgggtatgaa 


atgtgtgtta 


420 


gagttgggtt 


gaatgagata 


tacgttcaag 


tgaagtggca 


taccgttctc gagtaaggat 


480 


gacctaccca 


ttcttgagac 


aaatgttaca 


ttttagtatc 


agagtaaaat 


gtgtacctat 


540 
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aactcaaatt cgattgacat gtatccattc aacataaaat taaaccagcc tgcacctgca 600 

tccacatttc aagtattttc aaaccgttcg gctcctatcc accgggtgta acaagacgga 660 

ttccgaattt ggaagatttt gactcaaatt cccaatttat attgaccgtg actaaatcaa 720 

ctttaacttc tataattctg attaagctcc caatttatat tcccaacggc actacctcca 780 

aaatttatag actctcatcc ccttttaaac caacttagta aacgtttttt tttttaattt 840 

tatgaagtta agtttttacc ttgtttttaa aaagaatcgt tcataagatg ccatgccaga 900 

acattagcta cacgttacac atagcatgca gccgcggaga attgtttttc ttcgccactt 960 

gtcactccct tcaaacacct aagagcttct ctctcacagc acacacatac aatcacatgc 1020 

gtgcatgcat tattacacgt gatcgccatg caaatctcct ttatagccta taaattaact 1080 

catccgcttc actctttact caaaccaaaa ctcatcaata caaacaagac taaaaacata 1140 

cacgaggatc ctcagtcaca caaagagtaa agaagaaca atg get tec tct atg 1194 

Met Ala Ser Ser Net 
1 5 

etc tct' tec get act atg gtt gcc tct ecg get cag gee act atg gtc 1242 
Leu Ser Ser Ala Thr Met Val Ala Ser Pro Ala Gin Ala Thr Met Val 
10 15 20 

get cct ttc aac gga ctt aag tec tee get gee tte cca gcc acc ege 1290 
Ala Pro Phe Asn Gly Leu Lye Ser Ser Ala Ala Phe Pro Ala Thr Arg 
25 30 35 

aag get aac aac gac att act tec ate aca age aac ggc gga cgc gtt 1338 
Lys Ala Asn Asn Asp lie Thr Ser lie Thr Ser Asn Gly Gly Arg Val 
40 45 50 

aac tgc atg tct aga atg cca tec gag teg tea gac gea get cgt cct 1386 
Asn Cys Met Ser Arg Met Pro Ser Glu Ser Ser Asp Ala Ala Arg Pro 
55 60 65 

gcg eta aag cae gee tac aaa cct cca gea tct gac gcc aag ggc ate 1434 
Ala Leu Lys His Ala Tyr Lys Pro Pro Ala Ser Asp Ala Lys Gly He 
70 75 80 85 

aeg atg gcg ctg acc ate att ggc acc tgg acc gea gtg ttt tta cac 1482 
Thr Met Ala Leu Thr He He Gly Thr Trp Thr Ala Val Phe Leu His 
90 95 100 

gea ata ttt caa ate agg eta ceg aca tec atg gac cag ctt cac tgg 1530 
Ala He Phe Gin He Arg Leu Pro Thr Ser Met Asp Gin Leu His Trp 
105 110 H5 

ttg cct gtg tec gaa gcc aca gcc cag ctt ttg ggc gga age age age 1578 
Leu Pro Val Ser Glu Ala Thr Ala Gin Leu Leu Gly Gly Ser Ser Ser 
120 125 130 

eta ctg cac ate get gea gtc ttc att gta ctt gag tte ctg tac act 1626 
Leu Leu His He Ala Ala Val Phe He Val Leu Glu Phe Leu Tyr Thr 
135 140 145 
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ggt eta ttc 
Gly Leu Phe 


ate 
He 


acc 
Thr 


aca 
Thr 
155 


eat 
His 


gae 
Asp 


gca 
Ala 


atg 
Met 


cat 
His 
160 


ggc 
Gly 


acc 
Thr 


ata 
He 


get 
Ala 


ttg 
Leu 
165 


1674 


agg cac agg 
Arg His Arg 


cag 
Gin 


etc 
Leu 

i /U 


aat gat 
Asn Asp 


etc 
Leu 


ctt ggc 
Leu Gly 
175 


aac 
Asn 


ate 
He 


tgc 
Cys 


ata 
He 


tea 
Ser 
180 


Ctg 
Leu 


1722 


tac gcc tgg 
Tyr Ala Trp 


ttt 
Phe 

1 DC 

185 


gae 
Asp 


tac 
Tyr 


age 
Ser 


atg 
Met 


ctg 
Leu 
190 


eat 
His 


cgc 
Arg 


aag 
Lys 


cac 
His 


tgg 
Trp 
195 


gag 
Glu 


cac 
His 


1770 


cac 
His 


aac 
Asn 


cat 
His 
200 


act 
Thr 


ggc 
Gly 


gaa 
Glu 


gtg 
Val 


ggg aaa gae 
Gly Lys Asp 
205 


cct 
Pro 


gae 
Asp 


ttc 
Phe 
210 


cac 
His 


aag 
Lys 


gga 
Gly 


1818 


aat 
Asn 


ccc ggc 
Pro Gly 
215 


ctt 
Leu 


gtc 
Val 


ccc 
Pro 


tgg 

Trp 
220 


ttc 
Phe 


gcc 
Ala 


age 
Ser 


ttc 
Phe 


atg 
Met 
225 


tec 
Ser 


age 
Ser 


tac 
Tyr 


atg 
Met 


1866 


tec 
Ser 
230 


ctg 
Leu 


tgg 
Trp 


cag 
Gin 


ttt 
Phe 


gcc egg 
Ala Arg 
235 


ctg 
Leu 


gca 
Ala 


tgg 
Trp 


tgg 
Trp 
240 


gca 
Ala 


gtg 
Val 


gtg 
Val 


atg 
Met 


caa 
Gin 
245 


1914 


atg 
Net 


ctg ggg 
Leu* Gly 


gcg 
Ala 


ccc 
Pro 
250 


atg gca 
Met Ala 


aat 
Asn 


etc 
Leu 


eta 
Leu 
255 


gtc 
Val 


ttc 
Phe 


atg 
Met 


get 
Ala 


gca 
Ala 
260 


gee 
Ala 


1962 


cca 
Pro 


ate 
He 


ttg 
Leu 


tea 
Ser 
265 


gca 
Ala 


ttc cgc 
Phe Arg 


etc ttc 
Leu Phe 
270 


tac 
Tyr 


ttc 
Phe 


ggc 
Gly 


act 
Thr 


tac 
Tyr 
275 


ctg 
Leu 


cca 
Pro 


2010 


cac 
His 


aag 
Lys 


cct 
Pro 
280 


gag 

Glu 


cca 
Pro 


ggc cct 
Gly Pro 


gca gca ggc 
Ala Ala Gly 
285 


tet 
Ser 


cag 
Gin 


gtg 

Val 
290 


atg 
Met 


gcc 
Ala 


tgg 
Trp 


2058 


ttc 
Phe 


agg 
Arg 
295 


gcc 
Ala 


aag 
Lys 


aca 
Thr 


agt 
Ser 


gag 
Glu 
300 


gca 
Ala 


tet gat 
Ser Asp 


gtg 

Val 


atg 
Met 
305 


agt 
Ser 


ttc 
Phe 


ctg 
Leu 


aca 
Thr 


2106 


tgc 
Cys 
310 


tac 
Tyr 


cac 
His 


ttt 
Phe 


gae 
Asp 


ctg 
Leu 
315 


cac 
His 


tgg 
Trp 


gag 
Glu 


cac 
His 


cac 
His 
320 


aga 
Arg 


tgg 
Trp 


ccc 
Pro 


ttt 
Phe 


gee 
Ala 
325 


2154 


ccc tgg 
Pro Trp 


tgg 
Trp 


cag 
Gin 


ctg 
Leu 
330 


ccc 
Pro 


cac 
His 


tgc 
Cys 


cgc cgc 
Arg Arg 
335 


ctg 
Leu 


tec 
Ser 


ggg 

Gly 


egt 
Arg 


ggc 

Gly 
340 


ctg 
Leu 


2202 



gtg cct gee ttg gca tgacetggtc cctccgctgg tgacccagcg tctgcacaag 2257 
Val Pro Ala Leu Ala 
345 

agtgtcatgg agctcgaatt tccccgatcg ttcaaacatt tggcaataaa gtttcttaag 2317 
attgaatcct gttgeeggtc ttgegatgat tatcatataa tttctgttga attacgttaa 2377 
geatgtaata attaacatgt aatgeatgae gttatttatg agatgggttt ttatgattag 2437 
agtcccgeaa ttatacattt aatacgegat agaaaacaaa atatagegcg caaactagga 2497 
taaattatcg cgcgcggtgt catctatgtt actagatcgg gaattc 2543 
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<210> 2 
<211> 346 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: deduced fusion protein of 

transit peptide + peptide with beta-carotene C-4 oxygenase activity 

<400> 2 

Met Ala Ser Ser Met Leu Ser Ser Ala Thr Met Val Ala Ser Pro Ala 

15 10 15 

Gin Ala Thr Met Val Ala Pro Phe Asn Gly Leu Lys Ser Ser Ala Ala 
20 25 30 

Phe Pro Ala Thr Arg Lys Ala Asn Asn Asp lie Thr Ser He Thr Ser 
35 40 45 

Asn Gly Gly Arg Val Asn Cys Met Ser Arg Met Pro Ser Glu Ser Ser 
50 55 60 

Asp Ala Ala Arg Pro Ala Leu Lys His Ala Tyr Lys Pro Pro Ala Ser 
65 70 75 80 

Asp Ala Lys Gly He Thr Met Ala Leu Thr He He Gly Thr Trp Thr 
85 90 95 

Ala Val Phe Leu His Ala He Phe Gin He Arg Leu Pro Thr Ser Met 
100 105 110 

Asp Gin Leu His Trp Leu Pro Val Ser Glu Ala Thr Ala Gin Leu Leu 
115 120 125 

Gly Gly Ser Ser Ser Leu Leu His He Ala Ala Val Phe He Val Leu 
130 135 140 

Glu Phe Leu Tyr Thr Gly Leu Phe He Thr Thr His Asp Ala Met His 
145 150 155 160 

Gly Thr He Ala Leu Arg His Arg Gin Leu Asn Asp Leu Leu Gly Asn 
165 170 175 

He Cys He Ser Leu Tyr AXa Trp Phe Asp Tyr Ser Met Leu His Arg 
180 185 190 

Lys His Trp Glu His His Asn His Thr Gly Glu Val Gly Lys Asp Pro 
195 200 205 

Asp Phe His Lys Gly Asn Pro Gly Leu Val Pro Trp Phe Ala Ser Phe 
210 215 220 

Met Ser Ser Tyr Met Ser Leu Trp Gin Phe Ala Arg Leu Ala Trp Trp 
225 230 235 240 



Ala Val Val Met Gin Met Leu Gly Ala Pro Met Ala Asn Leu Leu Val 
245 250 255 
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Phe Met 



Ala 



Ala 
260 



Ala 



Pro 



lie 



Leu 



Ser 
265 



Ala 



Phe 



Arg 



Leu 



Phe 
270 



Tyr 



Phe 



Gly Thr Tyr Leu Pro His Lys Pro Glu Pro Gly Pro Ala Ala Gly Ser 
275 280 285 

Gin Val Met Ala Trp Phe Arg Ala Lys Thr Ser Glu Ala Ser Asp Val 

290 295 300 

Met Ser Phe Leu Thr Cys Tyr His Phe Asp Leu His Trp Glu His His 
305 310 315 320 

Arg Trp Pro Phe Ala Pro Trp Trp Gin Leu Pro His Cys Arg Arg Leu 
325 330 335 

Ser Gly Arg Gly Leu Val Pro Ala Leu Ala 



340 



345 
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